1 |
Vocapia-LIMSI System for 2020 Shared Task on Code-switched Spoken Language Identification
|
|
|
|
In: The First Workshop on Speech Technologies for Code-Switching in Multilingual Communities ; https://hal.archives-ouvertes.fr/hal-03091792 ; The First Workshop on Speech Technologies for Code-Switching in Multilingual Communities, Oct 2020, Shanghai, China (2020)
|
|
Abstract:
International audience ; This paper describes the systems submitted by Vocapia Research and LIMSI for the shared task on Code-switched Spoken Language Identification, organized in the conjunction with the First Workshop on Speech Technologies for Code-switching in Multilingual Communities 2020. Our primary system combines an acoustic approach based on i-vector modeling of audio segments with a phonotactic approach that focuses on sequences of language-independent phone units. Both modeling approaches provided comparable performance, and a gain was obtained by a simple linear combination of their scores, showing their complementarity. One of our submissions obtained first rank for all combinations of tasks and language pairs. For the utterancelevel detection task (task A), an F-measure of 76.0% was obtained with our combined system for which the average accuracy on the development set was 83.3%. For the frame-level detection task, the average accuracy was 81.2% on the development set and 78.7% on the evaluation set. However, a detailed analysis reveals a very high rejection of the 200ms codeswitched frames, which comprise only 12% of the corpus. This shows that a more precise modeling of code-switched segments is needed for an accurate segmentation.
|
|
Keyword:
[INFO]Computer Science [cs]; code-switching; language identification; phonotactic model
|
|
URL: https://hal.archives-ouvertes.fr/hal-03091792/file/code-switching-2020.pdf https://hal.archives-ouvertes.fr/hal-03091792 https://hal.archives-ouvertes.fr/hal-03091792/document
|
|
BASE
|
|
Hide details
|
|
2 |
Challenges in Audio Processing of Terrorist-Related Data
|
|
|
|
In: International Conference on Multimedia Modeling ; https://hal.archives-ouvertes.fr/hal-02415176 ; International Conference on Multimedia Modeling, Springer, Jan 2019, Thessaloniki, Greece (2019)
|
|
BASE
|
|
Show details
|
|
3 |
Challenges in Audio Processing of Terrorist-Related Data
|
|
|
|
In: International Conference on Multimedia Modeling ; https://hal.archives-ouvertes.fr/hal-02387373 ; International Conference on Multimedia Modeling, Springer, Jan 2019, Thessaloniki, Greece (2019)
|
|
BASE
|
|
Show details
|
|
4 |
Language Recognition for Dialects and Closely Related Languages
|
|
|
|
In: Odyssey 2016 ; https://hal.archives-ouvertes.fr/hal-01744188 ; Odyssey 2016, Jun 2016, Bilbao, Spain (2016)
|
|
BASE
|
|
Show details
|
|
6 |
Lexical speaker identification in TV shows
|
|
|
|
In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.archives-ouvertes.fr/hal-01690342 ; Multimedia Tools and Applications, Springer Verlag, 2015, 74 (4), pp.1377 - 1396. ⟨10.1007/s11042-014-1940-3⟩ (2015)
|
|
BASE
|
|
Show details
|
|
7 |
Traduction de la parole dans le projet RAPMAT
|
|
|
|
In: Journées d'Études sur la Parole ; https://hal.archives-ouvertes.fr/hal-01843418 ; Journées d'Études sur la Parole, Jan 2014, Le Mans, France (2014)
|
|
BASE
|
|
Show details
|
|
8 |
Comparing decoding strategies for subword-based keyword spotting in low-resourced languages
|
|
|
|
In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01843408 ; Annual Conference of the International Speech Communication Association , ISCA, Sep 2014, Singapore, Singapore (2014)
|
|
BASE
|
|
Show details
|
|
9 |
Combination of Cepstral and Phonetically Discriminative Features for Speaker Verification
|
|
|
|
In: ISSN: 1070-9908 ; IEEE Signal Processing Letters ; https://hal.archives-ouvertes.fr/hal-01690336 ; IEEE Signal Processing Letters, Institute of Electrical and Electronics Engineers, 2014, 21 (9), pp.1040 - 1044. ⟨10.1109/LSP.2014.2323432⟩ (2014)
|
|
BASE
|
|
Show details
|
|
10 |
Lattice MLLR based m-vector system for speaker verification
|
|
|
|
In: IEEE International Conference on Acoustics, Speech, and Signal Processing ; https://hal.archives-ouvertes.fr/hal-01836461 ; IEEE International Conference on Acoustics, Speech, and Signal Processing, Jan 2013, Vancouver, Canada (2013)
|
|
BASE
|
|
Show details
|
|
11 |
Unsupervised Speaker Identification using Overlaid Texts in TV Broadcast
|
|
|
|
In: Proceedings of the 13th Annual Conference of the International Speech Communication Association (Interspeech) ; Interspeech 2012 - Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-00767427 ; Interspeech 2012 - Conference of the International Speech Communication Association, Sep 2012, Portland, OR, United States. 4p (2012)
|
|
BASE
|
|
Show details
|
|
12 |
Recherche par le contenu dans des documents audiovisuels multilingues
|
|
|
|
In: ISSN: 1279-5127 ; EISSN: 1963-1014 ; Document Numérique ; https://hal.inria.fr/hal-00953796 ; Document Numérique, Lavoisier, 2010, 13 (1), pp.229-246 (2010)
|
|
BASE
|
|
Show details
|
|
13 |
Content-based search in multilingual audiovisual documents using the International Phonetic Alphabet
|
|
|
|
In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.archives-ouvertes.fr/hal-00953696 ; Multimedia Tools and Applications, Springer Verlag, 2010, 48 (1), pp.123-140. ⟨10.1007/s11042-009-0377-6⟩ (2010)
|
|
BASE
|
|
Show details
|
|
15 |
Exploitation d'un corpus bilingue comparable pour la création d'un système de traduction probabiliste Vietnamien - Français
|
|
|
|
In: TALN ; TALN 2009, Senlis, 24-26 juin 2009 ; https://hal.archives-ouvertes.fr/hal-00959202 ; TALN 2009, Senlis, 24-26 juin 2009, 2009, Unknown, pp.x-x (2009)
|
|
BASE
|
|
Show details
|
|
16 |
Mining a comparable text corpus for a Vietnamese - French statistical machine translation system
|
|
|
|
In: Fourth Workshop on Statistical Machine Translation ; https://hal.archives-ouvertes.fr/hal-01393602 ; Fourth Workshop on Statistical Machine Translation, 2009, Athens, Greece. pp.165 - 172, ⟨10.3115/1626431.1626466⟩ ; http://www.statmt.org/wmt09/ (2009)
|
|
BASE
|
|
Show details
|
|
17 |
Recherche par le contenu dans des documents audiovisuels multilingues
|
|
|
|
In: Actes de la conférence CORIA ; https://hal.inria.fr/hal-00954025 ; Actes de la conférence CORIA, 2009, Giens, France. pp.67-82 (2009)
|
|
BASE
|
|
Show details
|
|
18 |
Content-Based Search in Multilingual Audiovisual Documents using the International Phonetic Alphabet
|
|
|
|
In: 7th International Workshop on Content-Based Multimedia Indexing (CBMI 2009) ; https://hal.inria.fr/hal-00953855 ; 7th International Workshop on Content-Based Multimedia Indexing (CBMI 2009), 2009, Chania, Crete. 3-5 June 2009 (2009)
|
|
BASE
|
|
Show details
|
|
19 |
Normalisation et alignement de corpus français et vietnamiens : Format et Logiciels
|
|
|
|
In: Actes JATD 2008 ; journées internationales d'analyse statistique des données textuelles ; https://hal.archives-ouvertes.fr/hal-01705630 ; journées internationales d'analyse statistique des données textuelles, Jun 2008, Lyon, France (2008)
|
|
BASE
|
|
Show details
|
|
20 |
Acoustic-Phonetic Unit Similarities for Context Dependent Acoustic Model Portability
|
|
|
|
BASE
|
|
Show details
|
|
|
|